Dynamic Feature Description in Human Action Recognition

نویسندگان

  • Ruoyun Gao
  • Michael S. Lew
  • Ling Shao
چکیده

Dynamic feature description in human action recognition Ruoyun Gao ii Acknowledgement I wish to thank all those who helped me. Without them, I could not have completed this project. Firstly, I wish to express my deepest gratitude to Dr. Michael S. Lew for his valuable suggestions and unwavering support during my research project and graduation project. Without his encouragement and trust, I could not have finished my project so smoothly. I would also specially thank my company supervisor Dr. Ling Shao who gave me the great opportunity to work in Video Processing & Analysis, Philips Research and provided insightful guidance and continuous encouragement during my project. Many thanks to all my colleagues who helped, supported and accompanied me to conquer all the difficulties in my project. Abstract This thesis aims to present novel description methods for human action recognition. Generally, a video sequence can be represented as a collection of spatial temporal words by detecting space-time interest points and describing the unique features around the detected points (Bag of Words representation). Interest points as well as the cuboids around them are considered informative for feature description in terms of both the structural distribution of interest points and the information content inside the cuboids. Our proposed description approaches are based on this idea and making the feature descriptors more discriminative. In this thesis, we propose and validate three types of description methods: the first is projected 3D Shape Context (projected 3DSC) which is derived from the original shape context and makes use of the structural distribution of interest points; the second are the Transform based description methods which are widely exploited in image processing and utilizes the appearance information of images; the third is Correlogram of Oriented Gradient (COG) which is built from the spatial temporal gradients of each cuboid taking advantage of both the spatial structure and appearance information. The proposed methods are tested in the very challenging and well studied KTH dataset. The projected 3DSC improves the classification accuracy by 10% compared to that of the original 3DSC. And the Wavelet Transform based descriptor achieves as high as 93.89% recognition rate, which is better than any of the state-of-the-art methods. Correlogram of Oriented Gradient achieves 15% better than the popular Histogram of Oriented Gradient (HOG). Further more, we validate the efficiency of the Wavelet Transform based method on a more realistic and challenging human action dataset: the Hollywood dataset.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Feature Extraction of Face Images for Improvement of Recognition Accuracy

Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...

متن کامل

Analysis and Synthesis of Facial Expressions by Feature-Points Tracking and Deformable Model

Face expression recognition is useful for designing new interactive devices offering the possibility of new ways for human to interact with computer systems. In this paper we develop a facial expressions analysis and synthesis system. The analysis part of the system is based on the facial features extracted from facial feature points (FFP) in frontal image sequences. Selected facial feature poi...

متن کامل

Face Recognition Based Rank Reduction SVD Approach

Standard face recognition algorithms that use standard feature extraction techniques always suffer from image performance degradation. Recently, singular value decomposition and low-rank matrix are applied in many applications,including pattern recognition and feature extraction. The main objective of this research is to design an efficient face recognition approach by combining many tech...

متن کامل

Robot Arm Performing Writing through Speech Recognition Using Dynamic Time Warping Algorithm

This paper aims to develop a writing robot by recognizing the speech signal from the user. The robot arm constructed mainly for the disabled people who can’t perform writing on their own. Here, dynamic time warping (DTW) algorithm is used to recognize the speech signal from the user. The action performed by the robot arm in the environment is done by reducing the redundancy which frequently fac...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1101.0234  شماره 

صفحات  -

تاریخ انتشار 2009